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One of the most challenging open problems in quantum information theory is to clarify and 
quantify how entanglement behaves when part of an entangled state is sent through a quantum 
channel. Of central importance in the description of a quantum channel or completely positive map 
(CP-map) is the dual state associated to it. The present paper is a collection of well-known, less 
known and new results on quantum channels, presented in a unified way. We will show how this 
dual state induces nice characterizations of the extremal maps of the convex set of CP-maps, and 
how normal forms for states defined on a Hilbert space with a tensor product structure lead to 
C**) • interesting parameterizations of quantum channels. 

stones on which the newly established field of quantum information theory is build. The main gain of quantum over 
classical information processing stems from the fact that we are allowed to perform operations on entangled states: 

■ through the quantum correlations, an operation on a part of the system affects the whole system. One of the most 
£S| ' challenging open problems is to clarify and quantify how entanglement behaves when part of an entangled state is 

■ sent through a quantum channel. 

Of central importance in the description of a quantum channel or completely positive map (CP-map) is the dual 
state associated to it. This state is defined over the tensor product of the Hilbert space itself (the input of the channel) 
with another one of the same dimension (the output of the channel) . ft is clear that there appears a natural tensor 
\ product structure, and indeed the notion of entanglement will be crucial in the description of quantum channels. 
y—{ ■ In a typical quantum information setting, Alice wants to send one qubit (eventually entangled with other qubits) 
£NJ [ to Bob through a quantum channel. The channel acts linearly on the input state, and the consistency of quantum 
• mechanics dictates that this map be completely positive (CP) y|. This implies that the map is of the form 4] 

(N ■ 

o ■ 

Moreover the map is trace-preserving if no loss of the particle can occur. A natural way of describing the class of CP- 
maps is by using the duality between maps and states, first observed by Jamiolkowski [j| and since then rediscovered 
by many. We review some nice properties of CP-maps based on this dual description, and show how to obtain the 
extreme points of the convex set of trace-preserving CP-maps. 

The dual state is defined on a Hilbert space that is the tensor product of two times the original Hilbert space on 
which the map acts, and is therefore naturally endowed with a notion of entanglement. Unitary evolution for example 
corresponds to maximal correlations between the in- and output state, and this kind of evolution leads to a dual 
state that is maximally entangled. We will show how normal forms derived for entangled states lead to interesting 
parameterizations of CP-maps, and will discuss some issues concerning the use of quantum channels to distribute 
entanglement. 

It thus turns out that the techniques developed for describing entanglement can directly be applied for describing 
the evolution of a quantum system. Concepts as quantum steering and teleportation have a direct counterpart. A 
quantum channel for example will be useful for distributing entanglement if and only if the dual state associated to it 
is entangled, and optimal decompositions of states as derived in the case of entanglement of formation will yield very 
appealing parameterizations of quantum channels. 



I. CHARACTERIZATION OF CP-MAPS 

The most general evolution of a quantum system is described by a linear CP-map |jg|. In this section we will give 
a self-contained description of CP-maps or quantum channels. Most of the mathematics presented originate from the 
seminal papers of de Pillis p and Choi The fact that the evolution of quantum systems is described by linear 
completely positive maps is a consequence of the assumption of the linearity of the evolution (the complete positivity 
follows from consistency arguments once the linearity is accepted). 

Let us now recall some notations and useful tricks. Consider a pure state \x) in a Hilbert space that is a tensor 
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product of two Hilbert spaces of dimension n 

n 

Define 

n 

io=£k>K) 

i 

an unnormalizcd maximally entangled state and A the operator with elements = a^ , then 

\A)=A®I n \l). 

Moreover it holds that 

X <g> Y\A) =XA® Y\I) = XAY T <g> I n \I) =I n ® YA T X T \I). 

The symbol |J) will solely be used to denote the unnormalized maximally entangled state \I) — J^i N)- We are 
now ready for the following fundamental Theorem of de Pillis[6|: 

Theorem 1 A linear map $ acting on a matrix X is Hermitian-preserving if and only if there exist operators {Ai} 
and real numbers Xi such that 

i 

Proof: Suppose the map $ acts on anXn matrix. Then due to linearity, $ is completely characterized if we know 
how it acts on a complete basis of n x n matrices, for example on all matrices |e»)(e^|, 1 < i, j < n with |e») a complete 
orthonormal base in Hilbert space. Let us define the n 2 x n 2 positive matrix 

/ |ei><ex| ••• | ei )(e„|\ 

io<o = . a) 

\ |e n )(ei| • • • |e„)(e„| J 

being the matrix notation of a maximally entangled state in a n <£> n Hilbert space. It follows that all the information 
of a map <I> is encoded in the state 

p* = J„ ® $(|I)<7|), (2) 

as the n 2 n x n blocks represent exactly the action of the map on the complete basis |e^) (e^ | . If $ is Hermitian- 
preserving, then $(|ej)(ej|) has to be equal to the Hermitian conjugate of <&(|ej)(ej|), and this implies that p$ is 
Hcrmitian. Let us therefore consider the eigenvalue decomposition of p$ = Aj|x,)(xi|- Using the trick \A) = 
(A ® -01-0, we easily arrive at the conclusion that &(X) = JT,s \ AiXA\, where {A^} are the eigenvalues and where 
the operators {Aj } are the reshaped versions of the eigenvectors of p$. □ 

A central ingredient in the proof was the introduction of the matrix 

p$ =/„<8>$(|J)(0) 

with |0 = a maximally entangled state. We define this Hermitian matrix p$ as being the dual state 

corresponding to the map It was already explained that it encodes all the information about the map, and its 
eigenvectors give rise to the operators Ai. The above lemma characterizes all possible Hermitian preserving maps, 
and therefore surely all positive and completely positive maps. For example, let us consider the positive map that 
corresponds to taking the transpose of the density operators of a qubit: 
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Not all Hermitian-preserving maps are physical in quantum mechanics however: if a map acts on a subsystem, 
then it should conserve positivity of the complete density operator. This extra assumption leads to the condition of 
complete positivity, meaning that I m g) $ is positive for all m. Of course, this implies that the dual state p$ is not 
only Hermitian but also positive (i.e. all its eigenvalues are positive), as it is defined as the action of the map I n (g> <& 
on a maximally entangled state. The positive eigenvalues can then be absorbed into the (Kraus) operators {^U}, and 
we have therefore proven the Kraus representation Theorem (ChoiQ): 

Theorem 2 A linear map $ acting on a density operator p is completely positive if and only if there exist operators 
{Ai} such that 

§(p) = Y,A iP A\. 

i 

Remarks: 

• A CP-map is trace-preserving iff J2i M^-i = ^ni this property is easily verified using the cyclicity of the trace. 
In terms of the (unique) dual state p$ associated to the map this trace-preserving condition amounts to: 

Tr 2 (p$) = /„. 

Here the notation TY2 means the partial trace over the second subsystem. A CP-map is furthermore called 
bistochastic if also the condition 

Tr\{p<s,) = I n 

holds; this property is equivalent to the fact that the map is identity-preserving, i.e. &{I n ) = In- 

• The dual state p<s> corresponding to a CP-map $ is uniquely defined. The Kraus operators are obtained by 
considering the columns of a square root of p$ (Ai is obtained by making a matrix out of the i'th column of a 
square root of X, with p$, = XX'). As the square root of a matrix is not uniquely defined, the Kraus operators 
are not unique. Each different "square root" X of p<j, (p$ = XX') gives rise to a different set of equivalent 
Kraus operators. This implies that all equivalent sets of Kraus operators are related by an isometry, and that 
the minimal number of Kraus operators is given by the rank of the density operator /?$. Therefore we define the 
rank of a map to be the rank of the dual operator p$. This rank is bounded above by n 2 with n the dimension of 
the Hilbert space. A unique Kraus representation can be obtained by for example enforcing the Kraus operators 
to be orthogonal, as these would correspond to the unique eigenvectors of p<$>. Note that a similar reasoning 
applies to all Hermitian preserving and all positive maps, although there an additional sign should be taken into 
account. 

• By construction, we have proven that a map <£> acting on a n-dimensional Hilbert space is completely positive 
iff I n <8 $ is positive: there is no need to consider auxiliary Hilbert spaces with dimension larger than the 
original one. The reasoning is as follows: if /„ <g> $ is positive, then p<§, is positive, and therefore <£> has a Kraus 
representation, which implies complete positivity. 

• Suppose $ is positive but not completely positive. Then there exists a completely positive map $ and a positive 
scalar e such that 

$(p) = (l + ne)$(p)-eTr(p)7„. 

The proof of this fact is elementary: take e to be the opposite of the smallest eigenvalue of p$ (this eigenvalue is 
negative as otherwise $ would be completely positive), and define the CP-map &(p) = ($>(p) +neTr(p)I /n) / (1 + 
ne) (this map is completely positive because the dual state A$ associated to it is positive and has therefore a 
Kraus representation). Note that the whole reasoning is also valid for general Hermitian-preserving maps. As 
an example, consider again the transpose map on a qubit. Then it can be checked that the minimal value of e 
is I (this is true for the PT operation in arbitrary dimensions) and that the Kraus operators corresponding to 
$ become 
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• To make the duality between maps and states more explicit, it is useful to consider the following identity: 

®{p) = Tr 2 (f%{p®I n j), (7) 

where T\ means partial transposition with relation to the first subsystem. This can be proven by explicitly 
writing the map <& into Kraus operator form, and exploiting the cyclicity of the trace. Due to the partial 
transpose condition of Peres Q, it is clear that pj 1 will typically not longer be positive. This identity is very 
useful, and was used in the section on optimal teleportation with mixed states. 



II. EXTREME POINTS OF CP-MAPS 



The set of completely positive maps is a convex set: indeed, if <&i and $2 are CP-maps, then so is x§\ + (1 — x)<&2- 
Due to the one to one correspondence between maps $ and states p$ , it is trivial to obtain the extreme points of the 
set of completely positive maps: these are the maps with one Kraus operator, corresponding to p$ having rank 1. 

If however we consider the convex set of trace-preserving maps, the characterization of extreme points becomes 
more complicated. The knowledge of the set of extreme points of the trace-preserving CP-maps is very interesting 
from a physical perspective in the following way: suppose one has a multipartite state of qudits and one wants to 
maximize some convex functional of the state (e.g. the fidelity, ...) by performing local operations. Due to convexity, 
the optimal operation will correspond to an extreme point of the set of trace-preserving maps. 

Let us now characterize all extremal trace-preserving maps: 

Theorem 3 Consider a TPCP-map $ acting on a Hilbert space of dimension n and of rank m. Consider the dual 
state p<i = XX* with X an 2 xm matrix, and the n 2 matrices Xi = X^ [o~i® I n )X (the matrices {o~i} form a complete 
basis for the Hermitian n x n matrices). Then $ is extremal if and only if m < n and if the set of linear equations 
Vi : Tr(QXi) = has only the trivial solution Q = 0. 

This condition is equivalent to the following one given by ChoiyfrJ: given m 2 Kraus operators {A4} of a map <&, then 
the map is extremal iff the m 2 matrices {A\Aj}, 1 < i, j < m are linearly independent. 

Proof: The map $ is extremal if and only if there does not exist a R with the property that RW 7^ I and such that 
Tr 2 {X RR^ X 1 ) = I. This condition is equivalent to the fact that the set of equations 



/ 



Tr 



X {RR^ - I) xVi <g> / = 



V 



does only have the trivial solution Q = 0. As there are n 2 independent generators Ci and due to the fact that Q has 
m 2 degrees of freedom, it is immediately clear that there will always be a non-trivial solution if m > n, ending the 
proof. 

It remains to be proven that he condition obtained is equivalent to the one derived[43| by Choi 0. This can be seen 
as follows: the condition Tr2(XRR 1f X i ) — I is equivalent to the condition J2jk ^l^j'Ei RjiRki ~ $jk) = (this is 
readily obtained using the trick \A) — A® I\I)). Therefore a nontrivial solution of Q is possible iff the set of matrices 
{AJA,-}, 1 < i,j < m are linearly dependent. □ 

Note that the given proof is constructive and can therefore be used for decomposing a given TPCP-map into a 
convex combination of extremal maps: once a non-trivial Q and therefore R is obtained, one can scale it such that 
< J, and define another S = \/ 1 — RR^ . This S is guaranteed to be another trace-preserving map up to a 
constant factor, and the original map is the sum of the maps parameterized by XRR^X^ and XSS^X^ . 

All TPCP maps <& of rank 1 are of course extreme and correspond to unitary dynamics. One easily verifies that this 
implies that the dual p$ is a maximally entangled state. The intuition behind this is as follows: by equation (JJJ), p$ 
characterizes the correlation between the output and the input of the channel. Maximal correlation happens iff the 
evolution occurs reversibly and thus unitarily, and therefore corresponds to maximal "entanglement" between in- and 
output. We will explore this connection between maps and entanglement more thoroughly in the following section. 

One could go one step further, and try to characterize all extreme points of the convex set defined by all trace- 
preserving channels for which the extra condition holds that $(pi) = P2 with p\ and P2 given density operators. (Note 
that p\ and P2 can be chosen completely arbitrary, as there will always exist at least one TPCP-map that transforms 
a given state into another given one: consider for example the map with its associated dual state p^ — I ® P2-) 
Bistochastic channels are a special subset of this convex set of maps (in that case p\ = p% ~ /). An adaption of 
Theorem |3 leads to the following: 
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Theorem 4 Consider the convex set of trace-preserving CP-maps $ for which $(pi) = P2 with pi,p2 given. Suppose 
$ is of rank m, its dual state is p$ = XX* with X anxn matrix, and that there are m Kraus operators {Ai} . Then 
this map is extremal if and only if the set of 2m 2 linear equations 

Tt{QX\di <g> I)X) = Tv{QX\pl <g> Oi)X) = (8) 

has only the trivial solution Q = 0, or equivalently if and only if the m 2 operators {A\Aj © AjpiA\} (1 < i,j < m) 
are linearly independent. 

Proof: The proof is completely analogous to the proof of Theorem |3 but here we have the extra condition 

Tr (X(RR l - I)X{p{ ® a,)) = 0. 
In terms of Kraus operators, this additional condition becomes 

kj i 

which ends the proof. □ 

A similar Theorem was stated by Landau and Streater in the special case of bistochastic maps. In analogy with 
the conclusions of Theorem^ we conclude that the number of Kraus operators in an extremal TPCP-map of the kind 
considered in the above Theorem is bounded by [V2n 2 J . 

Let us for example consider the case of qubits. Then the rank of an extremal $ is bounded by 2, and extremal rank 
2 TPCP-maps obeying the condition &(pi) = pi typically exist. There is however a notable exception if p\= P2 = 1/2 
(i.e. when $ is bistochastic): a bistochastic qubit map has a corresponding dual p$ that is Bell-diagonal. A Bell- 
diagonal state is a convex sum of maximally entangled states, and therefore a rank 2 bistochastic map cannot be 
extremal. Note however that this is an accident, and for Hilbert space dimensions larger than 2 there exist extremal 
bistochastic channels that are not unitary Sometimes the name "unital" is also used instead of "bistochastic". 
The foregoing argument however shows that this terminology is not completely justified. 

One could now add more constraints 3>(p2j) = P2i+i, and this would lead to similar conditions for extremality in 
terms of the Kraus operators. Note however that the pi appearing in the constraints cannot be chosen completely 
arbitrary, as in general non-compatible constraints can arise due to the complete positivity condition on the physical 
maps (Deciding whether a set of conditions <&{p2i) — P2i+i is physical can be solved using the techniques of semidefinitc 
programming 

Let us now formulate another interesting Theorem: 

Theorem 5 Given a Hilbert space of dimension n and a trace-preserving map $ of rank m < n, then there exist pure 
states \ip) such that ^>(\ip}(ip\) are states of rank m—1. 

Proof: Let us first consider the case m = n, and define m Kraus operators {Ai} corresponding to Given a pure 
state \ip), then $ maps this state to one that is not full rank iff there exists a pure state \x) such that 

(x|<i>(|^)(^l)lx>-0 = ^|(x|^|^)l 2 - 

i 

Writing \x) = Vi\i)^ IV 7 ) — x i\^) an d — -4?^) then the previous equation amounts to solving the following 

set of bilinear equations: 

n m—n 

Vi = l:n,£(£ai4b)l/*=°- 
k=i j=i 

This set of equations always has a non-trivial solution. Indeed, the parameters Xj can always be chosen such that 
the matrix A = x jA\ k is singular (if all Ai are full rank then this can be done by fixing all but one of them, and 
then choosing the remaining parameter such that the determinant vanishes; if one of the Ai is rank deficient then 
the solution is of course direct). Then the parameters y k can be chosen such that the vector y is in the right kernel 
of A (the right kernel is not zero-dimensional as the dimension of the matrix A is n x n), and therefore < &(|'0)('!/'|) is 
not full rank. If m < n, then the right kernel of A is at least n — m + 1 dimensional, such that n — m + 1 linearly 
independent \x) can be found such that (xl'&OV'K^DIx) = 0, which ends the proof. □ 
In general , it is thus proven that one can always find states \ifi) such that the rank of is smaller than the 

rank of the map, which is surprising. Note that the bound in the Theorem is generically tight, i.e. the minimal rank 
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of the output state will typically be m — 1; this follows from the fact that decreasing the rank of the matrix A with 
two units would need n(n — l)/2 independent degrees of freedom, while there are only n — 1 available. 

Note that extremal TPCP-maps always fulfil the conditions of the Theorem. In particular, extremal qubit channels 
are generically of rank 2, and the previous Theorem implies that there always exist pure states that remain pure after 
the action of a rank 2 extremal map (This was also observed by Ruskai et al.fiolp. 

The above Theorem has also some consequences for the study of entanglement. Applying the foregoing proof to the 
dual state we can easily prove the following: if the rank of a mixed state p defined in a n x n dimensional Hilbert 
space is given by m < n, then there always exist at least (n — m + 1) linearly independent product states orthogonal 
to it. 

Let us now consider an example of the use of extremal maps. Suppose we want to characterize the optimal local 
trace-preserving operations that one has to apply locally to each of the qubits of a 2-qubit entangled mixed state, 
such as to maximize the fidelity (i.e. the overlap with a maximally entangled state). This problem is of interest in the 
context of teleportation [Til as the fidelity of the state used to teleport is the standard measure of the quality of 
teleportation. Badziag and the Horodecki's [lj| discovered the intriguing property that the fidelity of a mixed state 
can be enhanced by applying an amplitude damping channel to one of the qubits. This is due to the fact that the 
fidelity is both dependent on the quantum correlations and on the classical correlations, and enhancing the classical 
correlations by mixing (and hence losing quantum correlations) can sometimes lead to a higher fidelity. 

With the help of the previous analysis of extremal maps, we are in the right position to find the optimal trace- 
preserving map that maximizes the fidelity. Indeed, the optimization problem is to find the trace-preserving CP-maps 
$,4, $b such as to maximize the fidelity F defined as 

F{p, <f> A , $ B ) = y,\$ A (g> $ b (pM = Tr [p (& A ® $|,(|V>M)) } (9) 

with \ip) the maximally entangled state. This problem is readily seen to be jointly convex in and $g, and therefore 
the optimal strategy will certainly consist of applying extremal (rank 2) maps §a,^b- As we just have derived an 
easy parametrization of these maps, it is easy to devise a numerical algorithm that will yield the optimal solution. 

Note that the problem, although convex in $>a and $b, is bilinear and therefore can have multiple (local) maxima. 
This problem disappears when only one party (Alice or Bob) applies a map (i.e. <&b = This problem was 
studied in more detail by Rehacek et al.|l4|, where a heuristic algorithm was proposed to find the optimal local 
trace-preserving map to be applied by Bob. As the optimization problem is however convex, the powerful techniques 
of semidefinite programming [!j should be applied, for which an efficient algorithm exists that is assured to converge 
to the global optimum. Indeed, due to linearity the problem now consists of finding the 2-qubit state > with 
constraint Ttb(p^) — I such that the fidelity is maximized. As we already know, the algorithm will converge to 
a p$, of maximal rank 2 in the case of qubits. Exactly the same reasoning holds for systems in higher dimensional 
Hilbert spaces: if only one party is to apply a trace-preserving operation to enhance the fidelity, the above semidefinite 
program will produce the optimal local map that maximally enhances the fidelity. 

Other situations in which extremal maps will be encountered are for example the problem of optimal cloning 15, 
ITfil H?| : given an unknown input state p, one wants to construct the optimal trace-preserving CP-map such as to yield 
an output for which the fidelity with p ® p is maximal. This can again be rephrased as a semidefinite program whose 
unique solution will be given by an extremal trace-preserving CP-map. 



III. QUANTUM CHANNELS AND ENTANGLEMENT 



The physical interpretation of the dual state corresponding to a CP-map or quantum channel is straightforward. 
It is the density operator that corresponds to the state that can be made as follows: Alice prepares a maximally 
entangled state |7), and sends one half of it to Bob through the channel This results into 

A perfect quantum channel is unitary and the corresponding state p$, is a maximally entangled state. This corre- 
sponds to the case of perfect transmission of qudits, and indeed a maximally entangled state is the state with perfect 
quantum correlations. Consider now a completely depolarizing channel. In that case it is possible to transmit a 
classical bit perfectly, and indeed p<$, corresponds to a separable state with maximal classical correlations. As a third 
example, consider the complete amplitude damping channel. Then p$ is a separable pure state with no correlations 
whatever between Alice and Bob. It is therefore clear that the study of the character of correlation present in the 
quantum state p§ tells us a lot about the character of the quantum channel. 

This way of looking at quantum channels gives a nice way of unifying statics and dynamics in one framework: the 
future is entangled (or at least correlated) with the past. Just as a measurement in the future gives us information 
about the prepared system (through the use of the quantum Bayes rule), a measurement on Bob's side enables Alice 
to refine her knowledge of her local system (through the use of the quantum steering Theorem) |44| . It is therefore 
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clear that the description of entanglement will shed new light on the question of describing correlations between the 
states of the same system at two different instants of time, and vice-versa. Therefore we expect that many useful 
results concerning entanglement can directly be applied to quantum channels. On the other hand, a lot of work has 
been done concerning the quantification of the classical capacity of a quantum channel. These results offer a nice 
starting point for the study of classical correlations present in a quantum state. 



A. Quantum capacity 

The quantum capacity of a quantum channel is related to the asymptotic number of uses of the channel needed for 
obtaining states whose fidelity tends to one. To transmit quantum information with high fidelity, one indeed needs 
almost perfect singlets. It is immediately clear that ideas of entanglement distillation will be crucial: sending one 
part of an EPR through the channel will result in a mixed state, and these mixed states will have to be purified. 

Let us first establish a result that was already intrinsically used by many 0, 0, 0, |2(| : 

Theorem 6 A quantum channel $ can be used to distribute entanglement if and only if p$ is entangled. If p® is 
separable, then the Kraus operators of the map $ can be chosen to be projectors, and the map $ is entanglement 
breaking. 

Proof: The if part is obvious, as p$ is the state obtained by sending one part of a maximally entangled state through 
the channel. To prove the only if part, assume that p$ is separable. Then all Kraus-operators can be chosen to be 
projectors (corresponding to the decomposition with separable pure states), destroying all entanglement. □ 

It is also possible to make a quantitative statement: 

Theorem 7 Suppose we want to use the channel $ to distribute entanglement by sending one part of an entangled 
state through the channel. The maximal attainable fidelity (i.e. overlap with a maximally entangled state) corresponds 
to the largest eigenvalue o/p$. This maximal fidelity is obtained if Alice sends one half of the state described by the 
eigenvector of p$ corresponding to its largest eigenvalue. 

Proof: Suppose Alice prepares the entangled state |x) and sends the second part to Bob through the channel $ with 
Kraus-operators {Ai}. We want to find the state \x) such that 

(i\Y, I ® A i\x)(x\i®4\i) = { x \p$\x) (10) 

i 

is maximized, which immediately gives the stated result. □ 

The above result is amazing: it tells us that it is not always the best strategy to send one part of a maximally 
entangled state through the channel. It would be tempting to conjecture that the entanglement of distillation of the 
obtained state represents the quantum capacity of the given channel. 

Note that the eigenvalues and eigenvectors of p$ got an appealing interpretation: these represent the fidelities that 
are obtained by sending one half of the eigenvectors through the channel. Note also that the reduction criterion 

nam, 

I ® Tr 2 (/0$) — p$ = p$ 

n 

implies that p$, is entangled if its largest eigenvalue exceeds 1/n. This is of course in complete accordance with the 
previous Theorem, as the maximal fidelity for a separable state is also given by 1/n. 

A more sophisticated treatment of the quantum capacity of a quantum channel would involve ideas of coding and 
of quantum error correction, although only partial results have been obtained yet; the following is an incomplete list 
of papers where interesting results have been obtained 



B. Classical Capacity 



Let us now move towards the well-studied problem of classical capacity of a quantum channel. The central result 
is the Holevo- Schumacher- Westmoreland Theorem [2^, H0| j which tells us that the classical product state capacity 
of a quantum channel $ is given by 



= max J S(*(5>iPi)) - $>iS(*(p,0) \ ■ C 11 ) 

Pj ,Pj I I 
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Let us now ask the following question: what would be the analogy and the interpretation of this formula in the dual 
picture of states Using formula J7|), it holds that 

<Z>(p j ) = Tr 1 (p 9 (pj®I)). 

Suppose Alice and Bob share the state p$. Then the above formula describes how Bob has to update his local 
density operator when Alice did a measurement with corresponding POVM-element pj . Reasoning along the lines of 
the HSW-Theorem, the natural interpretation would now be that formula will give us a measure of how much 
(secret) classical randomness Alice and Bob can create using the state p$: if Alice implements a POVM measurement 
with elements {pj,pj}, this drives the system at Bob's side into a particular direction, and a measurement of Bob 
will reveal some information about the (random) outcome of Alice. Note that we interpret the presence of a bipartite 
state as being a particular kind of quantum channel. Note that the question of creating shared randomness has also 
been discussed in [3l|, |32| ■ 

The foregoing discussion suggests the following definition for the classical random correlations C cl present in a 
quantum state p: 

C B l {p AB ) = ^s{ PB )-Y,p 1 s{fy i B ) (12) 
P] = Trp(Ej®I) (13) 

Pb = -Tri (fi(Ej ® I)) • (14) 
Pj 

Here {Ej} presents the elements of the POVM implemented by Alice. Observe that there is an asymmetry in the 
definition, in that is not necessarily equal to C B l . This definition coincides with the one given by Henderson 
and Vedral |38j | . where they introduced this measure because it fulfilled the condition of monotonicity under local 
operations. 

In general, the classical mutual information obtained by the actions of Alice and Bob to obtain classical randomness 
will be smaller than the derived quantity (|12() . as coding is needed to achieve the Shannon capacity. This coding 
could be implemented by doing joint measurements, but we do not expect that the upper bound is tight; a better 
rate could be obtained if also public classical communication is allowed (A. Winter, unpublished). 



IV. ONE-QUBIT CHANNELS 

In the case of qubit channels, much more explicit results can be obtained, due to the fact that we have a fairly 
good insight into the properties of mixed states of two qubits. In this section we highlight some questions about qubit 
channels that can be solved analytically. 

Recall formula Q 

*(/>)= Tri(p£(p®J„)) (15) 

which is almost exactly the same expression as if Alice were measuring the POVM-element p on the joint state p$; 
the difference it that the partial transpose of this state has to be taken. It is now natural to look at the R-picture of 
the dual state p$ associated to the map |34j , where p is parameterized by a real 4x4 matrix 

Rij = Tr (pen ®(Tj) , 

< <7i < 3. In the R-representation, a partial transpose corresponds to a multiplication of the third column or row 
with a minus sign. Let us therefore define i?$ to be the parameterization of pj 1 in the i?-picture, i.e. the R-picture of 
p$ in which the third row is multiplied by —1. Note that the first row of i?$ is given by [1; 0; 0; 0], as this corresponds 
to the trace-preserving condition. 

If x is the Bloch vector corresponding, then the action of the map with corresponding pj 1 or i?$ is the following: 




. One can easily prove that the image of the Bloch sphere yields an ellipsoid, where the local density operator of 
Alice is represented by the center of the ellipsoid. This implies that the knowledge of the ellipsoid corresponds to the 
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complete knowledge of the quantum channel up to local unitaries at the input. (Note that not all ellipsoids correspond 
to physical maps, but that there is some restriction on the ratio of the axis). 

Let us now consider the anal ogue of local unitary (LU) and local filtering (SLOCC) equivalence classes as known 
for mixed states of two qubits |3 l| . What we are looking for are normal forms f2 (where f2 is a map) such that 
$(p) = BQ(ApA^)B^ with A, B € SU(2) or G SL(2, C). 

The LU case is very easy: each i?$ can be brought into the unique form 



1 











X 


Ai 








y 





A 2 





z 








±A 3 



by local unitary transformations, where Ai > A2 > | A3 1 and x,y > 0; one just has to take the singular value 
decomposition of the lower 3x3 block of R, taking into account that the orthogonal matrices have determinant +1 
(see also Fujiwara and Algoet j3f| and King and Ruskai [3(j for a different approach but with the same result) . 

Let us next move to SLOCC equivalence classes; it is clear that the Lorentz singular value decomposition [24j is all 
we need: 

Theorem 8 Given a 1-qubit trace-preserving CP-map $ and its dual Then the SLOCC normal form Q of i?$ 
is proportional to one of the following unique normal forms: 



( 1 

si 

s 2 

Vo s 3 



/ 1 \ 

x/Vi 

x/%/3 

V 2/3 1/3 / 



/l 000 





V 1 



(17) 



Here 1 > S\ > S2 > |«3 1 , 1 — Si — S2 — S3 > and < x < 1. For maps with a normal from of the first kind, one can 
choose the Kraus operators equal to 



{At} = {p AaoB, Pl Aa 1 B,p 2 Aa 2 B,p 3 Aa 3 B} 



(18) 



with A, B complex 2x2 matrices and pi > 0, related to the {s;} by the formula relating the eigenvalues of a Bell 
diagonal state to its Lorentz singular values. The Kraus operators of maps with a normal form of the second kind can 
be chosen to be of the form 



1 




B, 



-A 





1 



B, 



A 



1 




B}, 



(19) 



again with A, B complex 2x2 matrices. In the third case, the map is trivial as it maps everything to the same point. 
{s^, X, A, B, { P i} can be calculated explicitly by calculating the Lorentz singular value decomposition of the state p$. 



Proof: The proof is immediate given the Lorentz singular value decomposition. The first case corresponds to a 
diagonalizable R, and a diagonal R corresponds to a bistochastic channel. The second and third case correspond to 
non-diagonalizable cases (note that there are 2 normal forms in the case of states that do not apply here as they 
cannot lead to trace-preserving channels). □ 

This gives a nice classification of all the classes of TPCP-maps on qubits: the generic class is the one that can be 
brought into unital form by adding appropriate filtering transformations A, B, i.e. the ellipsoid can be continuously 
deformed to an ellipsoid whose center is the maximally mixed state. The non-generic class however cannot be deformed 
in this way: it is easy to show that the ellipsoid corresponding to the normal form touches the Bloch sphere at one 
and only at one point; there is no filtering operation that can change this property. We conclude that the ellipsoids 
in the non-generic case are not (and cannot be made by filtering operations) symmetric around the origin and that 
they touch the Bloch sphere at exactly one point. 

We depict both types of normal ellipsoids in figure ^ Note that this geometrical picture will be very useful in 
guessing input states that maximize the classical capacity of the state (see e.g. [36|L 



A. Extremal maps for qubits 



In the case of a qubit channel the dual state p$ is a mixed state of two qubits. It is possible to obtain an explicit 
parameterization of all extremal qubit maps (see also Ruskai et al. ,10] for a different approach): 
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FIG. 1: The image of a channel in generic normal form (left) or in non-generic normal form (right). 



Theorem 9 The set of dual states p$ corresponding to extreme points of the set of completely positive trace preserving 
maps $ on 1 qubit is given by the union of all maximally entangled pure states, and all rank 2 states p for which 
Tr 2 (,o$) is equal and Tri(p$) is not equal to the identity. The Kraus operators corresponding to the rank 1 extreme 
points are unitary, while the ones corresponding to the rank 2 extreme points have a representation of the form: 



with U, V unitary. 

Proof: We have already proven that extremal TPCP-maps have maximal rank 2. Due to the duality between maps 
and states, it is sufficient to consider rank 2 density operators of two qubits p$ for which Tr2(p<s>) = Ii- A real 
parameterization of all 2-qubit density operators p is given by the real 4x4 matrix R with coefficients 

Rij =Tr{pa i ®a j ) (21) 

where < i, j < 3. An appropriate choice of local unitary bases can always make the i?i:3,i : 3 block diagonal, and the 
trace-preserving condition translates into i?o,i:3 = 0- Therefore R is given by: 

R = 

The corresponding p is given by 





and the positivity of p constrains the allowed range of the 6 parameters. Let us now impose that the rank of the 
corresponding p is 2. This implies that linear combinations of 3 x 3 minors of p be zero, and after some algebra one 
obtains the following conditions: 

i 3 (A 3 + AiA 2 ) = 
i 2 (A 2 + AiA 3 ) = 
ii(Ai+A 2 A 3 ) = 

These equations, supplemented with the fact that diagonal elements of a positive semidefinite matrix are always bigger 
than the elements in the same column, lead to the conclusion that all ti but one have to be equal to zero if p is rank 
2. Without loss of generality, we can choose t\ = t% = and parameterize Ai = cos(ci), A 2 = cos(/3). We thus arrive 
at the canonical form 



*=\ n W ■ (22) 
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Suppose that sin(a) sin(/3) = (this condition is equivalent to Tr±(p^) — 1/2. Then the state corresponding to this 
R is Bell-diagonal and thus a convex sum of two maximally entangled states, and therefore the map corresponding 
to this state cannot be extremal. In the other case, an extre mal rank 2 TPCP -map is obt ained, which ca n easily be 
shown to yield the given Kraus representation, where sq — y/l — cos(a + /3)/2 and si = y/T— cos(a — P)/2. □ 

Note that the corresponding Theorem for bistochastic qubit channels is not very useful, as extremal TPCP qubit 
channels are always unitary. Theorem [S] however is very interesting, and indicates that there always exist pure states 
that remain pure after the action of the extremal qubit channel: indeed, if the basis vectors {\i}} are chosen according 
to the unitary V in 120|) . then it is easily checked that the states \ip) ~ S2\A — s !|0) ^ s i\A — s i remain pure by the 
action of the extremal map. Note that these two states are the only ones with this property, and note also that they 
are not orthogonal to each other. 



B. Quantum capacity 

Let us now move on to the relation between 1-qubit quantum channels and entanglement. We can now make use 
of the plethora of results derived for mixed states of two qubits. Let us first consider Theorem [5] about entanglement 
breaking channels. In the case of mixed states of two qubits, a state is entangled iff it violates the reduction criterion 
I ® Pb — P > 0. But in the case of the dual state p$, it holds that ps — 1/2, and therefore it holds that a quantum 
channel $ can be used to distribute entanglement iff the maximal eigenvalue of p$ exceeds 1/2 ^{|. In the light of 
Theorem |7| it follows that such a non-entanglement breaking channel can always be used to distribute an entangled 
state with fidelity larger than 1/2, which implies on its turn that it can be used to distill entanglement [18| . 

Consider now an entanglement breaking channel, i.e. a channel for which p$ is separable. In this case all the Kraus 
operators can be chosen to be projectors. An explicit way of calculating this Kraus representation exists. Indeed, in 
the section about entanglement of formation of two qubits, a constructive way of decomposing a separable mixed state 
of two qubits as a convex combination of separable pure states was given. It was furthermore proven that a separable 
state of rank 2 or 4 can always be written as a convex combination of 2 respectively 4 separable pure states, thus giving 
rise to 2 respectively 4 rank one Kraus operators. Surprisingly, most separable rank 3 mixed states of two qubits 
can only be written as a convex combination of 4 separable pure states. This implies that a generic entanglement 
breaking channel of rank 3 needs 4 Kraus operators if these are to be chosen rank 1. Let us also mention that the 
set of separable states is not of measure zero, implying that the set of entanglement breaking channels is also not of 
measure zero. 

The results of Wootters [3]J can of course also be applied to non-entanglement-breaking channels. A direct appli- 
cation of the formalism of Wootters yields the following Theorem: 

Theorem 10 Given a 1-qubit channel <I> and the state p$ associated to it. If C is the concurrence of p$, then the 
channel has a Kraus representation of the form: 

Hp) = ^ Pl (c/ l c^)p(c/ 4 c^) t (23) 



a _ i f VT+c + VT^c \ 

6 - 2 1 o VT+c-VT-c) { ' 

where Ui, Vi are unitary matrices. 

Proof: The Theorem is a direct consequence of the fact that a mixed state with concurrence C can be written as a 
convex sum of pure states all with concurrence equal to C. □ 

The geometrical meaning in the context of channels is the following: each trace-preserving CP-map is a convex 
combination of contractive maps in unique different directions, where each contraction has the same magnitude. 

Let us next address the question of calculating the quantum capacity of the one-qubit channel. Clearly, Theorem 
13 tells us what states to send through the channel such as to maximize the fidelity of the shared entangled states. In 
general, the quantum capacity cannot be calculated as we even don't have a way of calculating the entanglement of 
distillation of mixed states of two qubits (which is a simpler problem). 

In the case of unital channels of rank 2 however, the eigenvectors of p$ are maximally entangled and the quantum 
capacity can be calculated explicitly: 

Theorem 11 Consider a bistochastic qubit channel of rank 2. Then its quantum capacity is given by Cq = 1— H (p), 
where p is the maximal eigenvalue of p$ and H(p) — — plog 2 (p) — (1 — p) log 2 (l — p). 
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Proof: A unital qubit channel exhibits the nice property that no loss whatever occurs by sending a maximally entangled 
state through the channel: it can easily be shown (see Bennett et al.0|) that sending a quantum system through 
the channel is equivalent to using the standard teleportation channel induced by the (non-maximally entangled state) 
Because we can use the state /?$, obtained by sending a Bell state through the channel, to perfectly simulate the 
channel, this is clearly the optimal thing to do, and the quantum capacity of the channel is therefore equal to the 
distillable entanglement of p$ . Now Rains has proven that the distillable entanglement of a Bell diagonal state 
of rank 2 is given by Edist(p) = 1 — S(p), which ends the proof of the Theorem. □ 

More general, the quantum capacity of bistochastic qubit channel is always equal to the entanglement of distillation 
of the corresponding dual states (due to the arguments in the previous proof). 

As a last remark, we observe that the channels of the non-generic kind that touch the Bloch sphere at exactly one 
point are never entanglement-breaking: this follows from the fact that the concurrence of /?$ always exceeds in that 
case. 



C. Classical capacity 

Far more progress has been made concerning the classical capacity of quantum channels: it is known that the 
classical capacity using product inputs is given by the Holevo-x quantity. Here the geometrical picture derived in 
section 6.4 can sharpen our intuition. Consider for example the case of a unital channel. It is immediately clear that 
Holevo-x will be maximized by choosing a mixture of two states that lie on the opposite side of the major axis of 
the ellipsoid. This implies that the optimal input states are orthogonal. King and Ruskai [36l l39| even proved that 
entangled inputs cannot help in the case of unital channels, and we conclude that the classical capacity of the unital 
channels is completely understood. 

Consider however a non-unital channel of the generic kind. As proven before, this channel can be interpreted as the 
succession of a filter, a unital channel, and another filter. The critical source of noise or decoherence and irreversibility 
in a channel is the mixing, and the previous analysis tells us that this mixing can always be interpreted to happen in 
a unital way, whereas the in- and output of the unital channel is reversibly but non-orthogonally filtered. It follows 
that orthogonal inputs will not appear orthogonally in the unital channel, and typically orthogonal inputs will not 
achieve capacity. This strange fact was indeed discovered by Fuchs [40|, and it appears to be generic for non-unital 
channels. 

Let us now have a look at the non-generic family of channels, whose ellipsoids touch the Bloch sphere at exactly 
one point. It happens that the so-called stretched channel belongs to this family, and this channel has the property 
that its (product) capacity is only achieved for an input ensemble with three states |41|. This is surprising but not too 
surprising given the geometrical picture, as one of the input states corresponds to the pure output state, while the 
other two ones are chosen to lie symmetric around the axis connecting the maximally entangled state with the pure 
output state. Note however that most of the non-generic states achieve capacity with 2 input states. 

Let us now move to calculate the classical capacity of the extremal qubit channels. In the case of extremal qubit 
channels, it is possible to reduce the problem of calculating the classical (Holevo) capacity to an optimization problem 
over the ensemble average. The problem to be solved is as follows: find the optimal ensemble {pi,Pi} such that 

i i 

is maximized. We assume that $ is rank 2 and therefore has a Kraus representation of the form (|20|l . It is clear that 
only pure states {pi} have to be considered. It is easily seen that in the case of qubits, the entropy of a state is a convex 
monotonously increasing function of the determinant of the density operator: S(p) = if (1/2(1 — \/l~— 4dct(p) 2 )) 
with H{p) = plog(p) + (1 — p) log(l — p) the Shannon entropy function. Inspired by the analysis of 2-qubit channels 
by Uhlmann in terms of anti-linear operators |42j . we make the following observation: 

det(^i|V)(^|4 +A 2 |V)(Vl4) =\ip T {A^a y A 2 -Ala v A 1 )i>\. (25) 

Here ip is the vector notation (in the computational basis) of \if>), and a y is a Pauli matrix. Suppose now that we 
add an additional constraint to the problem, namely that the ensemble average p is given. Taking a square root 
X of p = XX\ all possible pure state decompositions can be written as X' = XU with U an arbitrary isometry 
(note that the columns of XU represent all unnormalized pure states in the decomposition). With this additional 
constraint, the problem can be solved exactly as we solved the entanglement of formation problem. A constructive 
way of obtaining the optimal decomposition of p is as follows: take a square root X of p, and calculate the singular 
value decomposition of the symmetric matrix X T (A\<jyAi — Aja y Ai)X — VYjV t . Call C = o\ — o-i the concurrence 
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with {oi} the singular values of the above symmetric matrix. Then the optimal decomposition is obtained by 
choosing U = V*0 with O the real orthogonal matrix that is chosen such that the diagonal entries of the matrix 
R = T (Diag[(7i, — er 2 ] — Cp)0) vanish. For given ensemble average p, the classical capacity is therefore given by the 
following formula: S($(p)) — /(C) (see also Uhlmann |4^1. 

To derive an explicit formula for the classical capacity of the extremal channels, we still have to do an optimization 
over all possible ensemble averages p. Note that the previous analysis already learned us that the capacity will always 
be reached with an ensemble of two input states. Both the terms $(/o) and C can easily be extremized separately, 
but unfortunately even if the eigenvalues of p are fixed, the optimal eigenvectors for maximizing S(p) and minimizing 
C are not compatible. However, the capacity can easily be calculated numerically, as it just an optimization problem 
over three real parameters. 

On the other hand, we have seen that the definition of the classical capacity had a direct counterpart in giving an 
appealing definition for the number of classical correlations present in a (mixed) bipartite state C c \ fsee I12[l. The 
techniques used in the foregoing paragraph are perfectly adequate to give an exact expression of this quantity if the 
shared quantum state is a rank 2 bipartite state p of qubits. Indeed, a mixed bipartite state of two qubits can just 
be seen as a more general kind of quantum channel. 

V. CONCLUSION 

We have shown that the natural description of quantum channels or positive linear maps is given by a dual quantum 
state associated to the map. This dual state is defined over a Hilbert space that is naturally endowed with a tensor 
product structure of the in- and output of the channel. We showed that the techniques developed in the context of 
entanglement are of direct use in describing positive maps. We derived a characterization of the extreme points of the 
convex set of trace-preserving completely positive maps, and gave some generalizations. We discussed some new results 
about the classical and quantum capacity of a quantum channel, and in the case of one-qubit channels wc showed 
how to exploit the duality between qubit channels and mixed states of two qubits to obtain useful parameterizations. 
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Actually, Choi derived the different problem of characterizing the extremal points of the (not necessarily trace-preserving) 
CP-maps that leave the identity unaffected, but his arguments are readily translated to the present situation. Note also 
that his proof was much more involved. 

In some sense one could argue that this was expected due to the fact that space and time play analogous roles in the 
theory of relativity. It is very nice however that in the non-relativistic case considered here, the duality is already present. 
This gives hope that it should be possible to generalize the current findings to the relativistic case. 
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